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Abstract 

On the multi-antenna broadcast channel, the spatial degrees of freedom support simultaneous transmission 
to multiple users. The optimal multiuser transmission, known as dirty paper coding, is not directly realizable. 
Moreover, close-to-optimal solutions such as Tomlinson-Harashima precoding are sensitive to CSI inaccuracy. 
This paper considers a more practical design called per user unitary and rate control (PU2RC), which 
has been proposed for emerging cellular standards. PU2RC supports multiuser simultaneous transmission, 
enables limited feedback, and is capable of exploiting multiuser diversity. Its key feature is an orthogonal 
beamforming (or precoding) constraint, where each user selects a beamformer (or precoder) from a codebook 
of multiple orthonormal bases. In this paper, the asymptotic throughput scaling laws for PU2RC with a large 
user pool are derived for different regimes of the signal-to-noise ratio (SNR). In the multiuser-interference- 
limited regime, the throughput of PU2RC is shown to scale logarithmically with the number of users. In 
the normal SNR and noise-limited regimes, the throughput is found to scale double logarithmically with the 
number of users and also linearly with the number of antennas at the base station. In addition, numerical 
results show that PU2RC achieves higher throughput and is more robust against CSI quantization errors than 
the popular alternative of zero-forcing beamforming if the number of users is sufficiently large. 

I. Introduction 

In multi-antenna broadcast channels, simultaneous transmission to multiple users, known as multiuser 
multiple-input-multiple-output (MIMO) or space division multiple access (SDMA), is capable of achieving 
much higher throughput than other multiple-access schemes such as time division multiple access (TDMA) 
[1]. Due to this advantage, SDMA has been recently included in the IEEE 802. 16e standard [2], and has 
been proposed for the emerging 3GPP long term evolution (LTE) standard [3]-[6]. While the optimal SDMA 
strategy is known, dirty paper coding [7] is non-causal and hence not directly realizable. Moreover, close- 
to-optimal techniques such as Tomlinson-Harashima precoding and vector perturbation are sensitive to CSI 
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inaccuracy [8], [9]. More practical SDMA algorithms are based on transmit beamforming, including zero 
forcing [10]— [13], a signal-to-interference-plus-noise-ratio (SINR) constraint [14], minimum mean squared 
error (MMSE) [15], and channel decomposition [16]. These SDMA algorithms can be combined with 
multiuser scheduling to further increase the throughput by exploiting multiuser diversity, which refers to 
scheduling only a subset of users with good channels for each transmission [17]-[23]. Both scheduling and 
beamforming in a SDMA system require channel state information (CSI) at the base station. Unfortunately, 
CSI feedback from each user potentially incurs excessive overhead because of the multiplicity of channel 
coefficients. Therefore, this paper focuses on SDMA that supports efficient CSI feedback and uses CSI for 
joint beamforming and scheduling. 

A. Related Work and Motivation 

In this paper, we consider a practical scenario where partial CSI is acquired by the base station through 
quantized CSI feedback, known as limited feedback [24]. Quantized CSI feedback for point-to-point com- 
munications has been extensively studied recently (see e.g. [24], [25] and the references therein). The effects 
of CSI quantization on a SDMA system have been investigated in [20], [26], [27]. The key result of [20] is 
that the number of CSI feedback bits can be reduced by exploiting multiuser diversity. In [26], combined 
quantized CSI feedback and zero-forcing dirty paper coding are shown to attain most of the capacity achieved 
by perfect CSI feedback. In [27], it is shown that for a small number of users the number of CSI feedback 
bits must increase with the signal-to-noise ratio (SNR) to ensure that the throughput grows with SNR. 

This paper addresses joint beamforming and scheduling for SDMA systems to maximize throughput, 
assuming backlogged users. A similar scenario but with bursty data and the objective of meeting quality-of- 
service (QoS) for different users is addressed in [28] and references therein. The optimal approach for our 
full-queue scenario involves an exhaustive search, where for each possible subset of users a corresponding 
set of beamforming vectors is designed using algorithms such as that proposed in [14]. The main drawback 
of the optimal approach is its complexity, which increases exponentially with the number of users. This 
motivates the designs of more efficient SDMA algorithms. 

In [22], a practical SDMA algorithm, called opportunistic SDMA (OSDMA), is proposed, which supports 
low -rate beamforming feedback and satisfies the orthogonal beamforming constraint. As shown in [22], for 
a large number of users, an arbitrary set of orthogonal beamforming vectors ensures that the throughput 
increases with the number of users at the optimal rate. Nevertheless, for a small number of users, such 
arbitrary beamforming vectors are highly sub-optimal due to excessive interference between scheduled users. 
To reduce multiuser interference caused by sub-optimal beamforming vectors, an extension of OSDMA, 
called OSDMA with beam selection (OSDMA-S), is proposed in [21], where each mobile iteratively selects 
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beamforming vectors broadcast by the base station and sends back its choices. Due to distributed beam 
selection, numerous iterations of broadcast and feedback are required for implementing OSDMA-S, which 
incurs significant downlink overhead and feedback delay. As a result, the throughput gains of OSDMA-S 
over OSDMA are marginal. 

An alternative beamforming SDMA algorithm is proposed in [20], referred to as ZF-SDMA, where 
feedback CSI is quantized using the random vector quantization (RVQ) algorithm [27], [29] and greedy- 
search scheduling is performed prior to zero-forcing beamforming. A design similar to ZF-SDMA [6] has 
been proposed to the emerging 3GPP-LTE standard [3], which is the latest cellular communication standard. 
The drawback of ZF-SDMA is its lack of robustness against CSI inaccuracy due to the separate designs of 
the limited feedback, scheduling and beamforming sub-algorithms. 

In industry, SDMA with orthogonal beamforming, under the name per user unitary and rate control 
(PU2RC) [5], has been proposed to the 3GPP-LTE standard. The main feature of PU2RC is limited feedback, 
where multiuser precoders or beamformers are selected from a codebook of multiple orthonormal bases. 
Based on limited feedback, PU2RC supports SDMA, scheduling, and adaptive modulation and coding. 
Because of its versatility and advanced features, PU2RC is one of the most promising solutions for high- 
speed downlink in 3GPP-LTE. The importance of PU2RC for the next-generation wireless communication 
motivates the investigation of its performance in this paper. 

In this paper, we consider a simplified PU2RC system where scheduled users have single data streams, 
which are separated by orthogonal beamformers. In this case, PU2RC generalizes OSDMA [22] by allowing 
the beamforming codebook to contain more than one orthonormal basis. Such a generalization complicates 
the performance analysis of PU2RC because the resultant scheduler is more complicated. To be specific, the 
scheduler has to select an orthonormal basis from the codebook besides choosing a particular user for each 
codebook vector. Such a challenge motivates our use of a new analytical tool, namely uniform convergence 
in the weak law of large numbers [30], for analyzing the throughput of PU2RC instead of extreme value 
theory as applied in [22]. 

Theory of uniform convergence in the weak law of large numbers is also applied in our previous work 
[31] for analyzing the throughput of uplink SDMA with limited feedback. Despite using the same tool, the 
analysis in this paper differs from [31] due to differences between the uplink and downlink. Specifically, the 
received data signal for the downlink propagates through a single-user channel, but that for the uplink passes 
through multiuser channels. As a result, SINR feedback for downlink SDMA is infeasible for uplink SDMA, 
where SINR depends on multiuser CSI and is hence uncomputable at users. Consequently, downlink and 
uplink SDMA require different designs of scheduling algorithm. Thus, the joint beamforming and scheduling 
algorithm presented in this paper is not applicable for uplink SDMA. Interestingly, despite the differences 
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between the uplink and downlink, the asymptotic throughput scaling laws for downlink SDMA as derived 
in this paper are found to be identical to those for uplink SDMA [31]. 

B. Contributions and Organization 

The main contribution of this paper is the analysis of the throughput scaling of PU2RC for an asymp- 
totically large number of users U — ► oo. Using the theory of uniform convergence in the weak law of 
large numbers, throughput scaling laws are derived for three regimes, namely the normal SNR, interference- 
limited and the noise-limited regimes. In the normal SNR regime, both the variance of noise and multiuser 
interference are comparable; in the interference-limited regime, multiuser interference dominates over noise; 
the reverse exists in the noise-limited regime. Our main results are summarized as follows. In the interference- 
limited regime, we show that the throughput scales logarithmically with U but does not increase with the 
number of transmit antennas N t at the base station. In both the normal SNR and noise-limited regimes, 
we show that the throughput scales double logarithmically with U and linearly with Nt. This throughput 
scalinglaw shows that PU2RC achieves the optimal multiuser diversity gain as OSDMA in the normal SNR 
regimqj. Thereby, this result contradicts the intuition that using multiple orthonormal bases in the codebook 
splits the user pool and hence reduces the multiuser diversity gain. Using Monte Carlo simulations, the 
asymptotic throughput scaling laws are also found to hold in the non-asymptotic regime where U is finite. 

The asymptotic throughput analysis for PU2RC provides several guidelines for designing the scheduler 
to ensure optimal throughput scaling. First, in the interference-limited regime, scheduling should use the 
criterion of minimum quantization error. Second, in the normal SNR regime, scheduled users should have 
both large channel power and small quantization errors. Third, in the noise-limited feedback, scheduling 
should select users with large channel power while the quantization error is a less important scheduling 
criterion. 

Numerical results are presented for evaluating the throughput of PU2RC and also comparing PU2RC with 
ZF-SDMA. Several observations are made. First, increasing the amount of CSI feedback (or the codebook 
size) can decrease the throughput for PU2RC if the number of users is small. Otherwise, more CSI feedback 
provides a throughput gain. Second, PU2RC achieves higher throughput than ZF-SDMA for large numbers 
of users but the reverse holds for relatively small numbers of users. Third, decreasing the codebook size 
causes a larger throughput loss for ZF-SDMA than that for PU2RC. 

The remainder of this paper is organized as follows. The system model is described in Section ITT] The 
sub-algorithms of PU2RC for CSI quantization, and joint beamforming and scheduling are presented in 
Section |III] The asymptotic throughput scaling of PU2RC is analyzed in Section [TV] The performance 

'The interference-limited and noise-limited regimes have not been considered for OSDMA in [22] 



ACCEPTED FOR PUBLICATION IN IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, REVISED ON MAY 3, 2008 



5 



Vu = \l -rrhl V w n x n + u u , u € A, (1) 



of PU2RC is evaluated using Monte Carlo simulation in Section [V] followed by concluding remarks in 
Section [VT] 

II. System Model 

The downlink or broadcast system illustrated in Fig. Q] is described as follows. The base station with 
N t antennas transmits data simultaneously to N t active users chosen from a total of U users, each with 
one receive antenna. The base station separates the multiuser data streams by beamforming, i.e. assigning a 
beamforming vector to each of the N t active users. The beamforming vectors {w n }^ r l 1 are selected from 
multiple sets of unitary orthogonal vectors following the beam and user selection algorithm described in 
Section IIII-BI Equal power allocation over scheduled users is considered^. The received signal of the uth 
scheduled user is expressed as 

where we use the following notation 

Nt number of transmit antennas and also num- w u (Nt x 1 vector) beamforming vector with 

ber of scheduled users; ll w «l| 2 = 1; 

h u (Nt x 1 vector) downlink channel; A The index set of scheduled users; 

x u transmitted symbol with E^x^j 2 ] = 1; P transmission power; and 

y u received symbol; v u AWGN sample with v u ~ CJ\f(0, 1). 
f conjugate transpose matrix operation; 

For the purpose of asymptotic analysis of PU2RC, we make the following assumption: 
Assumption 1: The downlink channel h u V u = 1, 2, • ■ ■ , U is an i.i.d. vector with £/V(0, 1) coefficients. 
Given this assumption commonly made in the literature of SDMA and multiuser diversity [18], [19], [21], 
[22], [27], the channel direction vector h u /||h u || of each user follows a uniform distribution. Assumption [j] 
greatly simplifies the throughput analysis of PU2RC in Section|lV]but has no effect on the PU2RC algorithms 
in Section |nl] Assumption [T] is valid for the scenario where wireless channels have rich scattering and users 
encounter equal path loss. Throughput analysis for a more complicated channel model is a topic for future 
investigation. 

III. Algorithms 

In this section, we propose the algorithms for PU2RC including (i) limited feedback by the mobiles and 
(ii) joint beamforming and scheduling at the base station. The principles for these algorithms have been 

2 Note that equal power allocation is close to the optimal water-filling method if scheduled users all have high SINR. 
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described in the proposal of PU2RC [5] even though their details are not provided therein. The algorithms 
presented in the following sections are tailored for the system model in Section JI] The following discussion 
on algorithms serves two purposes: (i) to elaborate the operation of PU2RC and (ii) to establish an analytical 
model for the asymptotic throughput analysis in Section JV] 

A. Limited Feedback 

Without loss of generality, the discussion in this section focuses on the uth user and the same algorithm 
for CSI quantization is used by other users. For simplicity, we make the following assumption 

Assumption 2: The uth user has perfect CSI h w . 
This assumption allows us to neglect the channel estimation error at the nth mobile. For convenience, the 
CSI, h u , is decomposed into two components: the gain and the shape. Hence, 

h u = g u s u , u = 1, • • • , U, (2) 

where g u = \\h u \\ is the gain and s u = h u /||h u || is the shape. The nth user quantizes and sends back to the 
base station two quantities: the channel shape and the SINR. 

The channel shape s u is quantized using a codebook-based quantizer [32] with a codebook comprised 
of multiple sets of orfhonormal vectors in C^*. Let T denote the codebook, y( m ) the rath orthonormal 
set in the codebook, and M the number of such sets. Thus, T = Um=i and the codebook size is 
N = \F\ = MN t . For our design, the M orthonormal bases of T are generated randomly and independently 
using a method such as that in [33]. Following [34] and [35], the quantized channel shape, represented by 
s u , is the member of T that forms the smallest angle with the channel shape s u . Mathematically, 

s u = argmind(v,s M ), (3) 

where the distortion function d(v,s u ) is given as 

= sin 2 (Z(v,s M )). (4) 

It follows that the quantization error can be defined as e = sin 2 (Z(s u , s u )). It is clear that e = if \sts u \ = 1 
and e = 1 if s u _L s u . 

The quantized channel shape s u is sent back to the base station through a finite-rate feedback channel 
[24], [34]. Since the quantization codebook T can be known a priori to both the base station and mobiles, 
only the index of s n needs to be sent back. Therefore, the number of feedback bits per user for quantized 
channel shape feedback is log 2 N since \T\ = N. The number of additional bits required for SINR feedback 
is discussed in Section |Vl 



d(v,s u ) 



v+s, 
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Besides the channel shape, the nth user also sends back to the base station the SINR, which serves as a 
channel quality indicator. For orthogonal beamforming, the SINR is given as [20] 

SINR U = 7 ^ (1 ~ €u) , (5) 

where 7 = ^- is the SNR, e u the CSI quantization error, and p u = |[h n || 2 the channel power. Since the 
SINR is a scalar and requires much fewer feedback bits than the channel shape, we make the following 
assumption: 

Assumption 3: The SINR„ is perfectly known to the base station through feedback. 
The same assumption is also made in [21], [22]. The effect of SINR quantization on the throughput is shown 
to be insignificant using numerical results in Section [V] 

B. Joint Scheduling and Beamforming 

This section focuses on the joint scheduling and beamforming algorithm designed based on the principles 
of PU2RC. Having collected quantized CSI from all U usersj, the base station schedules Nt users for 
transmission and computes their beamforming vectors. To maximize the throughput, Nt scheduled users 
must be selected through an exhaustive search, which is infeasible for a large user pool. Therefore, we adopt 
a simpler joint scheduling and beamforming algorithm. In brief, this algorithm schedules a subset of users 
with orthogonal quantized channel shapes, and furthermore applies these channel shapes as the scheduled 
users' beamforming vectors. 

The joint scheduling and beamforming algorithm is elaborated as follows. First, each member of the 
codebook T, which is a potential beamforming vector, is assigned a user with the maximum SINR. Consider 
an arbitrary vector, for instance vi m \ which is the nth member of the mth orfhonormal subset of the 
codebook T . This vector can be the quantized channel shapes of multiple users, whose indices are grouped 
in a set defined as 1^ = jl < u < U : s u = j where s u is the nth user's quantized channel shape 
given in ©. From ©, Xi m) can be equivalently defined as 

4 m ) = jl < u < U I d f^.vW) < d{s u , v) Vvefandv/ v^} . (6) 

Among the users in X„ , v„ is associated with the one providing the maximum SINR, which is feasible 
since the SNRs are known to the base station through feedback. The index (^h^j and SINR |(i m 'j of 
this user associated with vi" 1 ^ can be written as 

iM = arg max SINRu and = max SINR U , (7) 

3 For simplicity, we assume that the number of feedback bits per user is limited but not the total number of feedback bits from all 
users. Nevertheless, the sum feedback from all users can be reduced by allowing only a small subset of users for feedback, which 
is an topic addressed in a separate paper [36]. 
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where the index set T n m ^ and the function SINR U are expressed respectively in §6§ and (f5]). In the event 
that = 0, the vector v^ 1 ' is associated with no user and the maximum SINR £^ in (0 is set as zero. 
Second, the orthonormal subset of the codebook that maximizes throughput is chosen, whose index is m* = 
argmaxi< m <A/ Yln=i + £n"^J- Thereby, the users associated with this chosen subset, specified by 

the indices |i^ m ^ [ 1 < n < Nt\, are scheduled for simultaneous transmission using beamforming vectors 
from the (m*)fh orthonormal subset. 

The above scheduling algorithm does not guarantee that the number of scheduled users is equal to N t , 
the spatial degrees of freedom. For a small user pool, the number of scheduled users is smaller than Nt. 
This is desirable because it is unlikely to find Nt simultaneous users with close-to-orthogonal channels in 
a small user pool. In this case, having fewer scheduled users than Nt reduces interference and leads to 
higher throughput. As the total number of users increases, the number of scheduled users converges to N t . 
Numerical results on the average number of scheduled users for PU2RC are presented in Section [V] 

Based on the preceding algorithm for joint beamforming and scheduling, the ergodic throughput for 
PU2RC is given as 



R = E 



max 

Km<M 



V log 1 + max SINR U 

n=l V ueik n) J 



(8) 



where SINR U is given in (f5]). The scaling of R with the number of users U as U — > oo is analyzed in 
Section HVl 

IV. Asymptotic Throughput Scaling 

In this section, we derive the scaling laws of the PU2RC throughput for an asymptotically large number 
of users. Auxiliary results required in the analysis are first presented in Section IIV-AI Three SNR regimes, 
namely normal, interference-limited, and noise-limited regimes, are considered in Section IIV-BI to IIV-D1 
respectively. Finally, numerical results showing how the asymptotic throughput scaling laws apply in the 
non-asymptotic regime are presented in Section IIV-EI The asymptotic throughput scaling laws derived in 
this section for downlink SDMA are observed to be identical to those for uplink SDMA [31]. This suggests 
duality between uplink and downlink SDMA in terms of asymptotic throughput. 

A. Auxiliary Results 

Two auxiliary results are provided in this section. In Section IIV-A. 1 1 the theory of uniform convergence 
in the weak law of large numbers is discussed, which is an important tool for the subsequent asymptotic 
throughput analysis. The other useful result related to the channel-shape quantization error is presented in 
Section IW-AT21 
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1 ) Uniform Convergence in the weak law of large numbers: In this section, a lemma on the uniform 
convergence in the weak law of large numbers [30] is obtained by generalizing [37, Lemma 4.8] from R 3 
to C *. This lemma is useful for analyzing the number of users whose channel shapes lie in one of a set 
of congruent disks on the surface of a unit hyper-sphere in C *. 

Lemma 1 (Gupta and Kumar): Consider U random points uniformly distributed on the surface of a unit 
hyper-sphere in G Nt and N disks on the sphere surface that have equal volume denoted as A. Let T n denote 
the number of points belong to the nth disk. For every T\,T2 > 



Pr sup 

\ Kn<N 



<n > i-t 2 , U>U Q (9) 



3 16c 4 2 
U = max <! — log , — log — } (10) 



where 

U = max <; 

ti r 2 ti r 2 
and c is a constant. 

Proof: See Appendix J] □ 

2) Quantization Error of Channel Shape: The complementary CDF of the CSI quantization error e is 
analyzed as follows. As defined in Section UlI-AI e = sin 2 (Z(s, s)) where s and s are the original and the 
quantized channel shapes of an arbitrary user. From the quantization function in ©, the complementary 
CDF of e is 

Pr(e > 5) 



Pr (s<£ [J B s (y)^j , (11) 



where < 5 < 1 and B$(v) = {s € O 4 : |sW| 2 < 5} is a sphere cap on the unit sphere Nt . The CDF 
of e for < 5 < \ has the simple expression as given in the following lemma, but the derivation of CDF 
for | < 5 < 1 is difficult because the sphere caps {^(v) : v G J 7 } overlap. 
Lemma 2: The complementary CDF of e, Pr(e > 5), for < 8 < ^ is given as 

Pr(e >6)=[1- NtS 1 **- 1 )™ , < 5 < \, (12) 

where M is the number of orthonormal bases in the quantization codebook F. In addition, Pr(e > 8) < 
(l_^ t -i) M V0<K1. 

Proof: See Appendix Ull □ 

Next, the following lemma provides an upper-bound for the quantity E[— loge], which is useful for the 

throughput analysis in the sequel. The derivation of this result uses Lemma [2] and [27, Lemma 4]. 

Lemma 3: Given a codebook of M orthonormal bases, the following inequality holds 

logM logiVt logM+1 logNt 

+ T <E[-loge] < — T r ir + ^- T T (13) 



(N t - \)P a N t - 1 - 1 & J " (N t - l)P a N t - 1 



ACCEPTED FOR PUBLICATION IN IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, REVISED ON MAY 3, 2008 



10 



where e is the channel-shape quantization error and 



Pa = 1 



1 - N f 2- {Nt ~ l) 



M 



(14) 



Proof: See Appendix IITTI □ 
B. Normal SNR Regime 

In this section, the throughput scaling law of PU2RC is analyzed for the normal SNR regime, where the 
SINR and throughput are given respectively in (f5]) and ©. As shown in the sequel, in the normal SNR 
regime, the throughput of PU2RC scales double logarithmically with the number of users and linearly with 
the number of antennas. This throughput scaling law is identical to those for ZF-SDMA [20] and OSDMA 
[22]. Therefore, these algorithms all achieve optimal multiuser diversity gain. 

The procedure for deriving the throughput scaling law for PU2RC is to first obtain an upper-bound for 
the throughput scaling factor and second prove its achievability. The achievability proof uses Lemma Q] on 
the uniform convergence in the weak law of large numbers. The above procedure is also adopted for the 
throughput analysis for other regimes in subsequent sections. 

For the normal SNR regime, the throughput scaling factor for PU2RC is upper bounded as shown in the 
following lemma. 

Lemma 4: In the normal SNR regime, the throughput scaling factor for PU2RC is upper bounded as 

R 

lim < 1. (15) 

u^oo N t log log U ~ 

Proof: See Appendix ITVl □ 

Next, the upper-bound in ( fT51 ) is shown to be achievable. Thereby, the throughput scaling law of PU2RC 
in the normal SNR regime is obtained as shown in the following proposition. 

Proposition I: In the normal SNR regime, the throughput scaling law for PU2RC is 

R 

lim = 1. (16) 

u^oo N t log log U 

Proof: See Appendix [V] □ 

The proof uses Lemma Q] on the uniform convergence in the weak law of the large number. As shown in the 
proof, to achieve the throughput scaling law in (fT6l ). the quantization errors and channel power of scheduled 
users must scale with the number of users U as and log U, respectively. This suggests that a scheduler 
for the normal SNR regime should schedule users with both small quantization errors and large channel 
power as U increases. 
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C. Interference-Limited Regime 

In this section, the throughput scaling law of PU2RC is analyzed for the interference-limited regime where 
interference dominates over noise. By omitting the noise term, the SINR in (fS) for the interference-limited 
regime reduces to 

SINRM = 1 (17) 

where the superscript (a) identifies the interference-limited regime. By substituting (fTTT ) into ([8]), the through- 
put for the interference-limited regime is written as 

N t 



= E 



max log ( max — 



(18) 



The scaling law of with U is obtained as follows. 

The upper-bound of the scaling factor of R^ with U is shown in the following lemma. 

Lemma 5: In the interference limited regime, the throughput scaling factor is upper bounded as 

lim -j, < 1. (19) 

Proof: See Appendix [VT] □ 

This proof uses Lemma [3] in Section IIV-A.2I 

Next, the equality in (fT9l is shown to be achievable. The main result of this section is summarized in the 
following proposition. 

Proposition 2: In the interference-limited regime, the throughput scaling law for PU2RC is 

lim Nf - - = 1. (20) 



V-*oo 



jlogU 



Proof: See Appendix [VlTl □ 
Again, this proof makes use of Lemma \T\ on the uniform convergence in the weak law of large numbers. 
By comparing Propositions \T\ and [2j the throughput scales as j^rrf log U in the interference-limited regime 
but Nt log log U otherwise. The reason for this difference is that the asymptotic throughput is determined 
by the channel power (p) in the normal SNR and noise-limited regimes, but by the CSI quantization errors 
(e) of scheduled users in the interference-limited regime. In the normal SNR and noise-limited regimes, the 
asymptotic throughout can be written as iV^Eflogp], where p scales as \ogU due to multiuser diversity gain. 

In the interference-limited regime, the asymptotic throughput is given as A^E[— loge] and the scaling law 

i 

of e is U N f 1 . 

A few remarks are in order. 
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1) The linear scaling factor in (f20l >. namely N t /(N t — 1), is smaller than N t , which is the number of 
available spatial degrees of freedoms. This indicates the loss in multiplexing gain for Nt > 3 in 
the interference-limited regime. Such loss is not observed in the normal SNR (cf. Proposition [TJ or 
noise-limited (cf. Proposition [3]) regimes. 

2) In the interference-limited regime, scheduling users with small channel-shape quantization errors is 
sufficient for ensuring optimal throughput scaling. The reason is that the SINR in (fTTT ) depends only 
on the quantization error. 

3) In the interference-limited regime, the throughput scaling law for PU2RC is identical to that for ZF- 
SDMA [20, Theorem 2% 



D. Noise-Limited Regime 

In this section, the throughput scaling law of PU2RC in the noise-limited regime is analyzed, where noise 
dominates over multiuser interference. By removing the interference term (7/0«e u ) in ([5]), the SINR for the 
noise-limited regime is given as 

SINR(f) = 1Pu {\ - e u ) (21) 

where the superscript specifies the noise-limited regime. By substituting (|2TI ) into ([8]), the corresponding 
throughput is written as 

r N t 



rW = E I max V lo 

I 1 <m< A/f ^ ' 



Km<M ■ 
~ ~ n=l 



1 + max 7p„(l - e u ) 

ueik m) 



(22) 



The scaling law of R^> with U for U — » oo is obtained as shown in the following proposition. 
Proposition 3: In the noise-limited regime, the throughput for PU2RC scales as follows 

lim = 1. (23) 

u^oo N t log log U 

Proof: See Appendix [Ynil □ 

By comparing Proposition Q] and [3l the throughput scaling laws are observed to be identical for both the 
normal SNR and noise-limited regimes. Moreover, as reflected in the proof, to achieve the optimal throughput 
scaling law, scheduled users in the noise-limited regime are required to have channel power scaling as log U 
and quantization errors smaller than a constant d m - m defined in ((42b . Thus, for the noise-limited regime, 
channel power is a more important scheduling criterion than quantization errors. 

4 Note that [20, (45)] gives the throughput scaling law for a single scheduled user. Multiplication of this result with Nt gives the 
identical throughput scaling law for PU2RC as shown in Proposition [2] 



ACCEPTED FOR PUBLICATION IN IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, REVISED ON MAY 3, 2008 13 

E. N on- Asymptotic Regimes 

In preceding sections, the throughput scaling laws for PU2RC are derived for different asymptotic regimes 
characterized by an asymptotically large number of users (U —>■ oo). In this section, these asymptotic scaling 
laws are compared with their counterparts in the non-asymptotic regimes corresponding to a finite number 
of users (U < oo). The purpose of such a comparison is to evaluate the usefulness of the asymptotic results 
derived in previous section for characterizing the throughput of practical PU2RC systems. 

For this purpose, Fig. [2] shows the throughput versus number of users curves for the SNR values of 
{0,5,30} dB, corresponding respectively to the noise-limited, the normal SNR and the interference-limited 
regimes. The range of the number of users is 1 < U < 140, the number of transmit antennas is N t = 2 
and the codebook size is N = 16. The above curves present the PU2RC throughput scaling laws in the 
non-asymptotic regimes. Also plotted in Fig. [2] are the curves defined by the asymptotic throughput scaling 
law Ty^rT 1°§ U for the interference-limited regime (cf. Proposition [2]) and Nt log log U for both the normal 
SNR and the noise-limited regimes (cf. Proposition Q] and [3]). As observed from Fig. |2l as the number of 
users increases, the non-asymptotic curve for SNR = 30 dB becomes parallel to the curve following the 
asymptotic throughput scaling law j^tf log U. Likewise, the non-asymptotic curves for SNR = dB and 
5 dB have the same slopes as the corresponding asymptotic curve defined by N t log log U. Therefore, the 
asymptotic throughput scaling laws also hold in the non-asymptotic regimes. Note that the gaps between 
the asymptotic and non-asymptotic curves are throughput constant factors that become insignificant in the 
asymptotic regimes (U — > oo). 

V. Numerical Results 

In this section, various numerical results are presented. In Section IV-AI the effect of increasing channel 
shape feedback on throughput is investigated. In Section IV-B[ for an increasing number of users, the 
throughput of PU2RC is evaluated against that of ZF-SDMA in [20] as well as the upper bound achieved 
by dirty paper coding (DPC) and multiuser water filling [38]. For simplicity, Assumption [3] is made and 
thus the SINR feedback is assumed perfect for all algorithms in comparison. In Section IV-Cl the capacity 
loss due to the SINR quantization is characterized. 

A. Effect of Increasing Channel Shape Feedback 

For PU2RC, increasing channel shape feedback does not necessarily lead to higher throughput as shown 
in Fig. [3j In Fig. [3j the curves of PU2RC throughput versus the number of users U are plotted for different 
codebook sizes N. The SNR is 5 dB and the number of transmit antennas is Nt = 4. Fig. [3la) and 
Fig. Hb) show the small (1 < U < 50) and the large user ranges (1 < U < 200), respectively. As observed 
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from Fig. Ha), in the range of 4 < U < 22, increasing N decreases the throughput. The reason is that a 
larger codebook size divides the user pool because each user is associated with only one codebook vector (cf. 
Section ITlI-B I ). Consequently, increasing the codebook size reduces the probability of finding scheduled users 
with large channel gains and also associated with the same orfhonormal basis in the codebook. Nevertheless, 
such an adverse effect of increasing the codebook size diminishes as the number of users increases. As 
shown in Fig. for U > 70, a larger codebook size results in higher throughput. The above results motivate 
the need for choosing an optimal codebook size for a given number of users. 

B. Comparison with ZF-SDMA and Dirty Paper Coding 

Presently, PU2RC and ZF-SDMA [4], [6], [20] are two main solutions for multiuser MIMO downlink for 
3GPP-LTE. In this section, their performance is compared using numerical results. Moreover, the throughput 
of PU2RC is evaluated against the upper-bound achieved by dirty paper coding. 

In Fig. |U the throughput of PU2RC is compared with that of ZF-SDMA for an increasing number 
of users. The number of transmit antenna is N t = 4 and the SNR is 5 dB. Moreover, the codebook 
sizes N = {4,8,16,32} for channel shape quantization are considered. As in [20], the threshold 0.25 is 
applied in the greedy-search scheduling for ZF-SDMA. Fig. Ufa) and Fig. Ufb) show respectively the small 
(1 < U < 35) and the large (1 < U < 200) user ranges. As observed from Fig. Ufa), for a given codebook 
size (either N = 16 or N = 64), PU2RC achieves higher throughput than ZF-SDMA for a relative large 
number of users but the reverse holds for a smaller user pool. Specifically, in Fig. Ufa), the throughput curves 
for PU2RC and ZF-SDMA cross at U = 19 for N = 16 and at U = 27 for N = 64. For a sufficiently 
large number of users, PU2RC always outperforms ZF-SDMA in terms of throughput as shown in Fig. Hfb). 
Furthermore, compared with ZF-SDMA, PU2RC is found to be more robust against CSI quantization errors. 
For example, as observed from Fig. Hfb), for U = 100, the throughput loss for PU2RC due to the decrease 
of the codebook size from N = 64 to N = 16 is 0.3 bps/Hz but that for ZF-SDMA is 1.5 bps/Hz. The 
above observations are explained shortly. In summary, these observations suggest that PU2RC is preferred 
to ZF-SDMA for a large user pool but not for a small one. 

To explain the observations from Fig. 01 the average numbers of scheduled users for PU2RC and ZF-SDMA 
are compared in Fig. [5] for an increasing number of users. It can be observed from Fig. [5] that PU2RC tends 
to schedule more users than ZF-SDMA. First, for a small number of users, interference between scheduled 
users can not be effectively suppressed by scheduling, and hence more simultaneous users result in smaller 
throughput. This explains the observation from Fig. @Ia) that PU2RC achieves lower throughput than ZF- 
SDMA due to more scheduled users. Second, for a large user pool, the channel vectors of scheduled users 
are close-to-orthogonal and interference is negligible. Therefore, a larger number of scheduled users leads to 
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higher throughput. For this reason, PU2RC outperforms ZF-SDMA for a large number of users as observed 
from Fig. Hfb). Last, with respect to ZF-SDMA, the better robustness of PU2RC against CSI quantization 
errors is mainly due to the joint beamforming and scheduling (cf. Section IIII-BI ). Note that beamforming 
and scheduling for ZF-SDMA are performed separately [20]. 

Fig. [6] compares the throughput of PU2RC and ZF-SDMA for an increasing SNR. The number of transmit 
antennas is N t = 4 and the codebook size is N = 64. As observed from Fig. [6l for the number of users 
U = 20, PU2RC achieves lower throughput than ZF-SDMA over the range of SNR under consideration 
(0 < SNR < 20 dB). Nevertheless, for larger numbers of users (U = 40 or 80), PU2RC outperforms ZF- 
SDMA for a subset of the SNRs. Specifically, the throughput versus SNR curves for PU2RC and ZF-SDMA 
crosses at SNR=7 dB for U = 40 and at SNR=18 dB for U = 80. The above results suggest that in the 
practical range of SNR, PU2RC is preferred to ZF-SDMA only if the user pool is sufficiently large. 

Fig. E] compares the throughput of PU2RC with an upper bound achieved by dirty paper coding (DPC) 
and multiuser iterative water-filling [38]. A smaller number of antennas Nt = 2 is chosen to reduce the high 
computational complexity of iterative water-filling for a large number of users. Hence, each user has a 2 x 1 
multiple-input-single-output (MISO) channel. Moreover, SNR = 5 dB and the channel shape codebook size 
is N = {2, 4, 8, 16}. As observed from Fig. [71 the gap between the throughput of PU2RC and its upper 
bound narrows as the number of users U or the codebook size N increases. At U = 200 and N = 16, 
PU2RC achieves about 85% of the sum capacity of DPC. 

C. Effect of SINR Quantization 

In this section, using numerical results, a small number of bits for SINR feedback is found sufficient for 
making the capacity loss due to SINR quantization negligible. 

For PU2RC, Fig. [8] compares the cases of perfect and quantized SINR feedback. For quantizing SINR, 
a scalar quantizer using a squared-error distortion function is employed [32]. Moreover, the quantizer has 
a simple codebook containing evenly spaced scalars in the SINR range corresponding to a probability of 
99%. The number of transmit antenna is Nt = 4, the SINR is 5 dB and the codebook size for channel shape 
quantization is N = 16. As observed from Fig. [H 2 bits of SINR feedback per user causes only marginal 
loss in throughput with respect to the perfect SINR feedback. Such loss is negligible for 3-bit feedback. 
Therefore, a few bits of SINR feedback from each user is almost as good as the perfect case, which justifies 
Assumption [3j 

VI. Conclusion 

This paper presents asymptotic throughput scaling laws for SDMA with orthogonal beamforming known 
as PU2RC for different SNR regimes. In the interference limited regime, the throughput of PU2RC is shown 
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to scale logarithmically with the number of users but does not increase with the number of antennas. In the 
normal SNR or noise-limited regimes, the throughput of PU2RC is found to scale double logarithmically 
with the number of users and linearly with the number of antennas at the base station. Numerical results 
showed that PU2RC can achieve significant gains in throughput with respect to ZF-SDMA for the same 
amount of CSI feedback. 

This paper focuses on the scheduling criterion of maximizing throughput. The design and performance 
analysis of PU2RC based on the criterion of proportional fairness is a topic under investigation. Furthermore, 
the optimal deign for PU2RC for the non-asymptotic regime of the user pool remains as an open issue. 

Appendix I 
Proof of Lemma [T] 

Lemma 4.8 in [37] can be generalized from R 3 to C^* as follows. [37, Lemma 4.8] concerns N congruent 
disks on the surface of a sphere in R 3 , and its derivation relies on two results: the first one is the stereographic 
projection [39] that one-to-one maps a point on the surface of the sphere to a point on a plane both in EL 3 ; 
the second is that the Vapnik-Chervonenkis dimension of a set of disks on a plane in R 3 is three [37]. A unit 
hyper-sphere in C^ 4 can be treated as one in R 2iVt [35]. Thereby, the stereographic projection also exists 
between a unit hyper-sphere and a hyper-plane in <D * [40]. Next, following the same procedure as in [37, 
Lemma 4.6], the Vapnik-Chervonenkis dimension of a set of disks on a hyper-plane in € Nt is shown to be 
also three. Based on the two results obtained above for C^*, the remaining steps for proving Lemma [T] are 
identical to those for [37, Lemma 4.8] and are thus omitted. 

Appendix II 
Proof of Lemma [2] 

Since the orfhonormal bases in the codebook T are independently and randomly generated, the comple- 
mentary CDF (fTTb can be equivalently expressed as 

M 

Pr(e > 5) = J] Pr (s £ U veV(m ,£ 5 (v)) , (24) 

m=l 

where y( m ) denotes the mth orthonormal basis in T. Given that s is isotropically distributed on the unit 
sphere, (l24l can be re-written in terms of the volume of sphere caps [32] 

M 

Pr(e > 8) = Yl {1 - vol[U veV(m >£ 5 (v)]} . (25) 

m=l 
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Since the sphere caps {Bs(v)} | v € y( m )} are non-overlapping for 5 < ^ and the volume of each sphere 
cap is vol[-B<5(v)] = e)^' -1 as obtained in [22], we can obtain from (l25l ) 



Pr(e > 5) = Y[ (1 " N^- 1 ) , < 5 < \. 



(26) 



m=l 



The desired result in (TT2l follows from the last equation. Moreover, from (l25l ) and for v£f, Pr(e > 5) < 
11^=1 i 1 ~ vol [ 5 <5( v )]} = ]lm=i I 1 ~ ^ Aft_1 }> wm ch gives the inequality in the lemma. 



Appendix III 
Proof of Lemma [3] 

The minimum of M i.i.d. Beta (N t , 1) random variables, denoted as /3 2 , • • • , /?m}> has the following 
CDF [27] 



From (1271 ) and Lemma [2] 



Pr min /3 m > b) = (1 - b 



N t -1\M 



(27) 



(28) 



l<m<M 

where = represents equivalence in distribution. The above equivalence results in the following equality (a) 

1" 



E 



log ( N^-'e 



< E 

(a) 



E 



< 



(iV^e) |0<e< 2 



log ( min /3 m ) | < min /3 m < -N t 

\l<m<M J l<m<M 2 

E [- log (min m f3 m )} 



Pr( 0<min m /? m < IJV^" 1 



(29) 



As shown in [27, Lemma 4] 



logM 
N t -1 



< E 



log (min/? m 



< 



log M + 1 
JVM ' 



(30) 



By combining (1271) . d29l , and (1301 ). the desired inequality follows. 
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From © and © 



R = E 



< E 



< E 



Appendix IV 
Proof of Lemma H] 

N t 

max > log I max — 

Km<M ^— ' V„eT (r ™> 1 



n=l 



1 +7P« 



max >^ log (1+7 max p., 
> log (1 + 7 max max p u ) 

^ V l<m<Af ue i<-) 7 



log 1 + 7 max p u 

1 Ku<[/ 



The following result is well-known from extreme value theory (see e.g. [22, (A10)]) 

1 



Pr 



max p u — log U 

i<u<u H B 



< 0(loglog[7) > 1-0 



logU 



From 05b and 02 



R < iV^{log[l + 7logC/-70(loglogC/)]}Pr max p u < log C/ - 0(log log C/) 

, Ku<U 



NtE 



log ( 1 + 7 ^2 p u 



u=l 



O 



1 



\ogU 



< AT t £; {log [1 + 7 log C/-70(log log C/)]} + iV t log (1+7^)0 1 ' 



logCJ 



where (a) is obtained by using Jensen's inequality. The desired inequality follows from 

Appendix V 
Proof of Proposition Q] 

Define a set of disks on the unit hyper-sphere as 

£M(cZ) = js G | ||s|| 2 = 1, 1 - IsW^I 2 < d} 1 < m < M, 1 < n < N t 

where d is the radius of B^ n \d). Furthermore, define the user index sets 

= jl < u < U | s u G 0M } l<m<M,l<n<N t 

where the disk B^ is defined in (|35T ). By applying Lemma Q] with t% = T2 = A 
that 



2(logl/)"t- 



Pr |rM|> 



C/ 

(log co 



Af t -1 



> 1 



1 

2 (log I/) 



A 1 , 



— T V u >u 
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where U a is in (fTOb . Let U\ denote a sufficiently large integer such that C 2^ . From © and © and 
by replacing 2n with 7^ 



(a) 

> E 



> E 



} log max 

f-f Wr„ (m) 1 + lPue u J 

1 + 7P« 



,n=l 

' AT, 



y log max . , 

1+7 max ug ^< m ) p u 



iv t 



E lo Mi 



, 17 > Ux 

u>u x 

, U>Ui 



(38) 



u=1 v - + I5jT7 max u6 T«^ / 
where the inequality in (a) holds because u G T^ m ^ =^ e u < according to the definition in (l36l) . From 
d3Vl) and (EH) 



A 1 , 



1 + 7 max ^^-(m) p M \ A , , 



V 1 + ^ max ^ <m,/ ' u , 



U 



1 



1 



V C/ > max(E/i,J7 ). 



2(logC/) 7Vt - 1 
From the last inequality and d33l) , 

log [7 -O (log log tf) 



# > N t E 



log 1 + 



l/T + pogtr + oaogiogcr)]^ 



logC/ . 



1 -o 



1 



AT 4 



V(/> max([/i,C/ ) 



where U 



(logU) N t 



logU / 

-.It follows from the last inequality that 

R 



lim 



> 1. 



C/^oo N t log log [/ 
The desired result is obtained by combining d39l ) and Lemma 0] 



(log 17) 



N t -1 



1 - 



2 (log 17) 



N t -1 



(39) 



Appendix VI 
Proof of Lemma [5] 



From 03 



i?( a > < E 



.n=l 



mm mm e t 

<m<M uEX ( n 



N t E 



log min e u 

' \ !<«<[/ 



(40) 
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In the above equation, mini<„<[/ e u follows the same distribution as the quantization error for an enlarged 
codebook having MU orthonormal bases. Therefore, from (|40T > and Lemma [3] 



R (a) < 



N t 



log U + log M + 1 



+ log N t 



(41) 



N t -1 i _ [i _ Nt 2-{N*-l) 
The desired upper bound of the throughput scaling factor follows from the last inequality. Note that 

{l - [1 - N t 2-( N *-V] MU \ -> 1 as U oo. 

Appendix VII 
Proof of Proposition |2] 

Define the minimum distance of the codebook T as 

1 - IvVl 2 



d mhl = min ■ 



Moreover, similar to (|36"1) . define the index set of the users in the disk Bn (<i m i n ) (cf. (1331 )) as 

= [l<u<U\s u G B^ (d min )} , 1 < m < M, 1 < n < N t . 



(42) 



(43) 



By the definitions in © and (|42l . s u G ^i™' 1 (d m i n ) I mi „. Using this fact, a throughput lower bound 

follows by replacing 1 m ^ n in ([8]) with T m jU 



Aft 



max 7 log I max — 

l<m<M^ V.«GT„ <m) £n 



> E 



N, 



n=l 



> log I max 

^ K ner„ (m » e„ 



(44) 



By applying Lemma [T] with ri = r% = U I and A = d^ in 1 , the numbers of users belonging to the index 
sets ( f36b satisfy 



Pr ( min 



><&- l u-m \>i-u 



where U a is defined in (flOl ). From (l44b and (1431 ) 



log f min ||TM|> fl^U -V* 



V u>u n 



1 - cr 



(45) 



U>U n . 



By applying Lemma [3] 



N, 



N t -1 



log M + log C" 1 + log U + log(l - U") 



+ log N t 



where P a is modified from (fT4)) as 



P„ = 1 - 1 - /y,2 _(JVt_1) 



Mid^U-Ui) 



1-U-), U>U (46) 



(47) 
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It follows from the last inequality that 



lim — w 



> 1. 



(48) 



Combining (148T ) and (fT9l gives the desired throughput scaling law for the interference-limited regime. 



Appendix VIII 
Proof of Proposition [3] 



From (1221 ) and since < e u < 1 

i? (/3) < E 



max 

Km<M 



V" log I 1 + 7 max p u 

n=l V uei^ 



(49) 



In d3TT ) in Appendix [TV] the above upper-bound is also used for bounding the PU2RC throughput in the 
normal SNR regime. Therefore, the upper-bound for the throughput scaling factor as obtained in Appendix 
is also applicable for the present case, hence 



lim < 1. 

c/— oo N t log log U 



(50) 



(m) 



Next, the above upper bond is shown to be achievable as follows. By replacing the index set X\ in (|22 
with its subset TS m ^ defined in (|43l) 



RW > E^log 



(a) 



v n=l 



1 + 7 max p u (l - e u ) 



> E^log 



,n=l 
' N t 



1 + 7(1 - d min ) max p u 



1 + 7(1 - d m i n ) max p u 



U>U 



> E^log 

I 71=1 

> N t E {log [l + 7(1 - d min ) log^d^U - 1) + 7(1 - cUn)0(log log U)\ } f 1 - ^) 



l^|><^-l|(l-I) 



1-0 



1 



log £7 



,u>u 



The inequality (a) follows from the definition of Tn in (I43T ). The inequality (b) follows from (143T ). The 
inequality (c) is obtained by using (|33l ). It follows from (c) that 



lim 



> 1. 



£/->oo AT t log log f7 
Combining d50b and (fSTJ gives the desired throughput scaling law. 



(51) 
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Fig. 2. Comparison between asymptotic and non- asymptotic throughput scaling laws for PU2RC for SNR = {0, 5, 30} dB, the 
codebook size N = 16, and the number of transmit antennas Nt = 2. 
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Fig. 3. Throughput of PU2RC for an increasing number of users U, SNR = 5 dB, and the number of transmit antennas N t = 4. 
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Fig. 4. Throughput comparison between PU2RC and ZF-SDMA for an increasing number of users U, SNR = 5 dB, and the 
number of transmit antennas N t = 4. 
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Fig. 5. The average numbers of scheduled users for PU2RC and ZF-SDMA for SNR = 5 dB, and the number of transmit antennas 

N t = 4. 
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Fig. 6. Throughput comparison between PU2RC and ZF-SDMA for an increasing number SNR; The codebook size = 64 and 
the number of transmit antennas Nt = 4. 
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Fig. 7. Comparison between the throughput of PU2RC and its upper bound achieved by dirty paper coding (DPC) and multiuser 
iterative water-filling for an increasing number of users U, SNR = 5 dB, and the number of transmit antennas Nt = 2. 
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Fig. 8. The effect of SINR quantization for SNR = 5 dB, the number of transmit antennas Nt = 4, and the codebook size for 
channel shape quantization is N = 16. 



